JUMP-DIFFUSION RISK-SENSITIVE ASSET MANAGEMENT I: 
DIFFUSION FACTOR MODEL 



MARK DAVIS* AND SEBASTIEN LLEOt 

Abstract. This paper considers a portfolio optimization problem in which asset prices are 
represented by SDEs driven by Brownian motion and a Poisson random measure, with drifts that 
are functions of an auxiliary diffusion factor process. The criterion, following earlier work by Bielecki, 
Pliska, Nagai and others, is risk-sensitive optimization (equivalent to maximizing the expected growth 
rate subject to a constraint on variance.) By using a change of measure technique introduced by 
Kuroda and Nagai we show that the problem reduces to solving a certain stochastic control problem in 
the factor process, which has no jumps. The main result of the paper is to show that the risk-sensitive 
jump diffusion problem can be fully characterized in terms of a parabolic Hamilton-Jacobi-Bellman 
PDE rather than a PIDE, and that this PDE admits a classical (C^'^) solution. 

1. Introduction. In this article, we consider a finite time jump-diffusion version 
of the risk sensitive asset management problem of Bielecki and Pliska [5] . Fundamen- 
tally, our main result is to show that the resulting stochastic control problem can 
be fully characterized by a parabolic Hamilton-Jacobi-Bellman PDE rather than a 
PIDE, and that this PDE admits a classical {C'^''^) solution. 

Risk-sensitive control is a generalization of classical stochastic control in which the 
degree of risk aversion or risk tolerance of the optimizing agent is explicitly parameter- 
ized in the objective criterion and influences directly the outcome of the optimization. 
In risk-sensitive control, the decision maker's objective is to select a control policy 
h{t) to maximize the criterion 

J{x,t,h-e) := -iluE \e-SF(x,h)'\ -^^ 

where t is the time, x is the state variable, is a given reward function, and the 
risk sensitivity > is an exogenous parameter representing the decision maker's 
degree of risk aversion. A Taylor expansion of the previous expression around 9 — Q 
evidences the vital role played by the risk sensitivity parameter: 

J(x, t, /i; 61) = E [F{x, t, h)] - ^Var [F{x, t, h)] + 0{e^) (1.2) 

This criterion amounts to maximizing E [F(x, t, h)] subject to a penalty for variance. 
For a general reference, see Whittle [39 . Much of the recent literature concerns the 
infinite time horizon problem: 

J^{x,h;e) := liminf-ii^^nE [e"^-''^'''''^] (1.3) 

This is interesting from a theoretical perspective, but is not applicable to practical 
asset management because of the non- uniqueness of controls. Optimality in this sense 
is a 'tail property': if h*{t) is optimal, then so is h{t) = h*{t)lt>T + h{t)lt<T for any 
arbitrary process h(t) and time T > 0. Of course, near-term decisions are the ones 
that are of primary importance to investment managers. 
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In the past decade, the apphcations of risk-sensitive control to asset management 
have flourished. Risk-sensitive control was first applied to solve financial problems 
by Lefebvre and Montulet [53] in a corporate finance context and by Fleming [IS] 
in a portfolio selection context. However, Bielecki and Pliska j5] were the first to 
apply the continuous time risk-sensitive control as a practical tool that could be used 
to solve 'real world' portfolio selection problems. They considered a long-term asset 
allocation problem and proposed the logarithm of the investor's wealth as a reward 
function, so that the investor's objective is to maximize the risk-sensitive (log) re- 
turn of his/her portfolio or alternatively to maximize a function of the power utility 
(HARA) of terminal wealth. They derived the optimal control and solved the asso- 
ciated Hamilton- Jacobi-Bellman (HJB) PDE under the restrictive assumption that 
the asset and factor noise are uncorrelated. This assumption is unrealistic and it was 
later relaxed (see [5]). The contribution of Bielecki and Pliska to the field is immense: 
they studied the economic properties of the risk-sensitive asset management criterion 
(see [7]), extended the asset management model into an intertemporal CAPM ([8]), 
worked on transaction costs ([6]), numerical methods ([4]) and considered factors 
driven by a CIR model ([9]). A major contribution to the mathematical theory was 
made by Kuroda and Nagai [23 who introduced an elegant solution method based 
on a change of measure argument which transforms the risk sensitive control problem 
in a linear exponential of quadratic regulator. They solved the associated HJB PDE 
over a finite time horizon and then studied the properties of the ergodic HJB PDE. 
Recently, Davis and Lleo [H] applied this change of measure technique to solve, at a 
finite and an infinite horizon, a benchmarked investment problem in which an investor 
selects an asset allocation to outperform a given financial benchmark. The problem 
we consider is also related to the vast literature on HARA utility maximization that 
has fiourished in the past 50 years. This literature includes a number of references re- 
lated to risk-sensitive control, such as works by Fleming and Sheu ([17]; [E] and [19) ) 
or Hansen and Sargent [22] in the context of robust control. 

Risk-sensitive asset management theory was originally set in a world of diffu- 
sion dynamics where randomness is modelled using correlated Brownian motions. To 
our knowledge, the only attempt to extend the risk-sensitive asset management the- 
ory from a diffusion to a jump diffusion setting was made by Wan [38] who briefly 
sketched a jump-diffusion extension of Bielecki and Pliska's [5| original infinite hori- 
zon risk-sensitive asset management model. Wan's treatment is however restrictive 
as it only considers a single Poisson process-driven jump per asset and assumes that 
the underlying valuation factor risks and asset risks are uncorrelated. Our paper ad- 
dresses these two limitations. The setting of our control problem, which takes place 
within a finite time horizon, allows for both infinite activity jumps in asset prices and 
for a correlation structure between factor risks and asset risks. To solve this con- 
trol problem we extend Kuroda and Nagai's powerful change of measure technique 
to account for the jumps. One of the difficulties we face in extending this technique 
is proving that the optimal control is admissible as this requires showing that the 
Doleans exponential (|2.9|) associated with this control is a martingale. In a pure dif- 
fusion setting, this would follow easily from the Kamazaki condition or the Novikov 
condition. However, when the Doleans exponential does not have continuous path, as 
is the case in a jump diffusion setting, proving that it is indeed a martingale is more 
difficult as only weaker partial results exist. This question is addressed in Appendix 
\X\ of the present paper. 

2 



In this paper the asset price processes are modelled as jump-diffusions whose 
growth rates are functions of an auxiliary 'factor' process X{t) which satisfies a lin- 
ear diffusion SDE. Our main result is that the risk-sensitive jump-diffusion asset 
management problem is equivalent to an optimal control problem for a diffusion pro- 
cess (no jumps) and that the HJB equation for the latter admits a unique classical 
C^''^ {[0,T) X M") solution. Showing the existence and uniqueness of a solution to 
a risk-sensitive control problem can prove difficult even in a pure diffusion setting. 
For example, Bensoussan, Freshe and Nagai [3] had to constrain the behaviour of the 
Hamiltonian in order to prove existence of a classical solution. Still in a pure diffusion 
setting, Fleming and Soner (see V.9 in [20]) proved that the value function is a contin- 
uous viscosity solution of the associated Hamilton Jacobi Bellman Partial Differential 
Equation (HJB PDF) but had to assume boundedness of all coefficients and of the 
derivatives of the reward function. No such strong condition is required to solve the 
jump diffusion problem considered in this article. In fact all our assumptions arise 
naturally from the structure of the risk-sensitive asset management problem. Unique- 
ness follows from a classical verification argument while the proof of existence relies 
on a policy improvement algorithm and on the properties of linear parabolic PDFs. 

The paper is organized as follows. We first introduce the general setting of the 
model in section 2 and define the class of random Poisson measures which will be 
used to model the jump component of the asset dynamics. In Section 3, we formulate 
the jump-diffusion control problem and introduce the change of measure argument of 
Kuroda and Nagai [27]. In a pure diffusion case, this is enough to transform the prob- 
lem into a standard Linear Exponential of Quadratic Regulator. In our jump-diffusion 
setting, the change of measure simplifies the problem by associating the HJB PDF 
given in Section 13.31 rather than the expected Partial Integro-Differential Equation 
containing non-local terms, to the value function. It is striking that an optimal con- 
trol problem for a jump-diffusion model has a solution that is characterized in terms 
of a HJB PDF and not a HJB PIDE0. 

Our main result is Theorem 14.31 in Section |31 The proof depends on various 
technical arguments which are given in Sections 5 to 7. In section 5, we show the 
existence of a unique optimal control before addressing two key questions in Section 
6. First, the admissibility of the optimal control is no longer a priori guaranteed be- 
cause the Doleans exponential defining the Radon-Nikodym derivative does not have 
continuous paths. This point is addressed in Propositions 6.3 and 6.4. Second, the 
Risk-Sensitive Hamilton- Jacobi-Bellman Partial Differential Equation (RS HJB PDF) 
contains a jump-induced control-dependent integral term: it is no longer possible to 
find an analytical solution and the existence of a strong, classical solution is no longer 
guaranteed. However, should we be able to prove the existence of a classical C^'^ 
solution to the RS HJB PDF, then we can prove uniqueness and resolve the control 
problem using a straightforward verification theorem, presented in Theorem 6.1 and 
Corollary 6.2 in Section 6. 

In Section 7 we address existence and regularity of solutions to the RS HJB PDF. 
We show, in Theorem 7.2 and Corollary 7.4, that the risk-sensitive jump-diffusion con- 
trol problem we consider admits a unique classical C^'^ ([0,^) x E") solution. Show- 
ing the existence and uniqueness of a solution to a risk-sensitive control problem can 



See 0ksendal and Sulem |42| for a treatment of jump-diffusion control problems. 
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prove difficult even in a pure diffusion setting. For example, Bensoussan, Frehse and 
Nagai [3] had to constrain the behaviour of the Hamiltonian in their finite time hori- 
zon problem to prove existence of a classical solution. Still in a pure diffusion setting 
and over a finite time horizon, Fleming and Soner (see V.9 in [2^) proved that the 
value function is a continuous viscosity solution of the associated Hamilton Jacobi 
Bellman Partial Differential Equation (HJB PDF) but had to assume boundedness of 
all coefficients and of the derivatives of the reward function. No such strong condition 
is required to solve the jump diffusion problem considered in this article. In fact all 
our assumptions arise naturally from the structure of the risk-sensitive asset manage- 
ment problem. We obtain our result by applying an approximation in policy space 
in a two-step process: first, we show existence on a bounded region and then extend 
to the unbounded state space. With this result in hand, we have all the ingredients 
needed for the proof of Theorem 14.31 

Up to this point, we have assumed that the factor process X{t) is directly ob- 
served by the controller, and therefore represents real economic factors: GDP growth, 
inflation, the S&P500 index, etc. We may however wish to use X{t) as an abstract 
latent factor, introduced to model volatility of returns, in which case only the prices, 
and not X(t), will be observed. In our final Section 8, we note that this problem, once 
adequately reformulated, can be solved using a classical Kalman filter, as in !33 , as 
the jump noise is absent from the dynamics of X{t). While this is from a technical 
point of view a simple observation, it greatly enhances the applicability of our results. 

In a companion paper [14] we consider the case in which there are jumps in both 
the price and factor processes. There the measure change technique does not remove 
the jumps and the argument is substantially different. 

2. Analytical Setting. 

2.1. Overview. The growth rates of the assets are assumed to depend on n fac- 
tors Xi{t), . . . , Xn{t) which follow the dynamics given in equation (j2.3|) below. As in 
Kuroda and Nagai's asset-only model, the assets market comprises m risky securities 
Si, z = 1, ... TO. In contrast to Kuroda and Nagai, we assume that the money market 
account process, 5*0, is an affine function of the valuation factors, which enables us 
to easily model a stochastic short term rate. Let M := n + m. Throughout, we will 
assume that m > n; this is needed in connection with the 'zero-beta' policies intro- 
duced in Section [7?T] 

Let {ft, {J-t} P) be the underlying probability space. On this space is defined 
an R*^-valued (J'tj-Brownian motion W{t) with components Wk{t), k = 1,...,M. 
Moreover, let be a (7^t)-Poisson point process on (0, oo) x Z, independent of W{t), 
where {Z,Bz) is a given Borel spac^l. Define 

5:^{U eB{Z),E[N{t,U)]<ooyt} (2.1) 

Finally, for notational convenience, we fix throughout the paper a set Zq G Bz such 



is a Polish space and Bz is the Borel cr-field. See Ikeda and Watanabe [23] for a formal 
definition of the Poisson point process 
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that i^(Z\Zo) < oo and define 

N{dt,dz) (2.2) 

N{dt, dz) - N{dt, dz) = N{dt, dz) - v{dz)dt -. N{dt, dz) if z e Zq 
N{dt,dz) ifzGZ\Zo 

2.2. Factor Dynamics. The dynamics of the n factors are expressed by the 
affine diffusion equation 

dX{t) = {b + BX{t))dt + KdW{t), X{Q) = X (2.3) 

where X{t) is the R"-valued factor process with components Xj{t) and h G R", 
B e M"""" and A e M"'^^'. 

2.3. Asset Market Dynamics. Let 5*0 denote the weahh invested in the money 
market account with dynamics given by the equation: 



So{t) 



= (ao + Al^X{t)) dt, 50(0) = so (2.4) 



where oq G M is a scalar constant, ^o G R" is a n-element column vector and through- 
out the paper x"^ denotes the transpose of the matrix or vector x. 

Let Si{t) denote the price at time t of the ith security, with i — 1, . . . , m. The 
dynamics of risky security i can be expressed as: 

dS^it) " 



^ia + AX{t)),dt + ^cr,kdWk{t)+ f j,{z)N{dt,dz), 
k=i -^z 

5,(0) = s„ i = l,...,m (2.5) 



where a G M™, A e M™><", S := [ct^j] , i = 1, . . . , m, j = 1, . . . , M and 7(2) G 
satisfying Assumption [TJ 

Assumption 1. j{z) g R™ satisfies 

- 1 < 7,""" < 7.(2) < 7™'^" < +00, * = 1, . . . , m 

and 

- 1 < 7f"" < < 7™-^" < +00, z = 1, . . . , m 
/or I = 1, . . . , m. Furthermore, define 

S := supp(z/) G 

and 

S :=supp(i^o7-i) G i3(M™) 

where supp(-) denotes the measure's support, then we assume that YiiLiilT'"'' ^ 
is the smallest closed hypercube containing S. 
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In addition, the vector-valued function j{z) satisfies: 

[ |7(z)pi/(dz) < oo (2.6) 

Note that Assumption [T] implies that each asset has, with positive probabihty, 
both upward and downward jumps. As wih become evident in Section |3.2[ the effect 
of this assumption is to bound the space of controls. Relation (j2.6p is a standard 
condition. 

Define the set J' as 

J:={/ieR'";-l-/i'^V<0 VtAgs} (2.7) 

and let J' be the closure of J'. For a given z, the equation hF"f{z) = — 1 describes a 
hyperplane in M™. J' is a bounded open convex subset of M™. 

2.4. Portfolio Dynamics. We will need the following assumptions: 
Assumption 2. EE^ > 

The effect of this assumption is to prevent redundant assets. For example, we 
will not able to model in our investment market a share and an option or futures on 
that share. However, this assumption leaves us free to model a wide range of assets 
such as shares, bonds and commodities products as well as related indices. 

Assumption 3. AA^ > 0. 

Let Qt '■= cr[[S{s),X{s)),Q < s <t)he the sigma-field generated by the security 
and factor processes up to time t. An investment strategy or control process is an 
K™-valued process with the interpretation that hi{t) is the fraction of current port- 
folio value invested in the ith asset, i — 1, . . . , m. The fraction invested in the money 
market account is then /io(i) = 1 — T^T=i ^ii^)- 

Definition 2.1. An -valued control process h[t) is in class Ti if the following 
conditions are satisfied: 

1. h{t) is progressively measurable with respect to {B{[0, t]) C>5 ^t}t>o '^^'^ cddldg; 

2. P \h{s)\^ds < +oo) =1, yt> 0; 

3. h^{t)-i{z) > -1, Vt > 0,z e Z, a.s. dv. 
Define the set K. as 

IC := {h en : h{t,uj) e J a.e. (dt x dP} (2.8) 

Lemma 2.2. Under Assumptions^ a control process hit) satisfying condition 3 
in Definition \2.1\ is bounded. 

Proof. The proof of this result is immediate. □ 

Definition 2.3. A control process h is in class A(T) if the following conditions 
are satisfied: 
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1. he U; 

2. Ex^ = 1 where x?; * € (0, T], is the Doleans exponential defined as 

Xt ■■= exp j-^* h{sfj:dWs - h{sfj:j:^h{s)ds 

ln{l -G{z,h{s)))N{ds,dz) 
{In (1 - G{z, h{s))) + G{z, h{s))} v{dz)ds 



J7, 

I 



J7, 



(2.9) 



G{z,h) = l- {\ + h^-i{z)) (2.10) 

We say that a control process h is admissible if h & -^(^) • 

The proportion invested in the money market account is ho{t) = 1 — Y^^i ^ii^)- 
Taking this budget equation into consideration, the wealth, V{t) of the investor in 
response to an investment strategy h{t) G %, follows the dynamics 

= (ao + A^X{t)) dt + h^{t) [a - uqI + {A- 1A^) X{t)) dt 

+h'^{t)J:dWt + J h'^{t)j{z)N{dt,dz) 

where 1 G denotes the m-element unit column vector and with 1^(0) = v. Defining 
a := a — cqI and A:= A — IAq , we can express the portfolio dynamics as 

= (ao + A'^X{t)) dt + h^{t) (a + Ax{t)) dt + h^{t)^dWt + f h^{t)-f{z)N{dt, dz) 

(2.11) 

with initial endowment 1^(0) = 0. 



3. Problem Setup. 

3.1. Optimization Criterion. We will assume that the objective of the in- 
vestor is to maximize the risk adjusted growth of his/her portfolio of assets over a 
finite time horizon. In this context, the objective of the risk-sensitive management 
problem is to find h* € A{T) that maximizes the control criterion 

J{x,t,h-e) := -iluE \e-O^^V(t,x,h)\ 



By Ito's Lemma, the log of the portfolio value in response to a strategy h is 
\nV{t) = lnv + {ao+A'^X{s))+h{sf(a + AX{s)^ds-^J^ /i(s)^EE^/i(s)ds 

+ / h{sfmWis) 



+ 



I I {\n{l + h{sf^{z))-h{sf^{z)]v{dz)ds 

Jo JZo 



+ / / + h{sfj{z))N{ds,dz) 
Jo Jz 
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(3.2) 



Hence, 



where 



(3.3) 



g{x, h) = -{e + 1) EE' h - aa - A'^ x - h' {a + Ax) 



(l + h^jiz)) -1 +h^j{z)lz,{^)\,y{dz) (3.4) 



and the Doleans exponential Xt is given by (|2.9p . 



3.2. Change of Measure. Let P/i be the measure on (HjJ't) defined via the 
Radon-Nikodym derivative 

^ - (3 5) 

For a change of measure to be possible, we must ensure that the following technical 
condition holds: 

G{z,h{s)) < 1 

This condition is satisfied iff 

h^is)"f{z) > -1 (3.6) 

a.s. dv, which was already required for h(t) to be in class H (Condition 3 in Defini- 
tion [2lT]). Condition p.6p is endogenous to the control problem and can be interpreted 
as a risk management safeguard preventing the investor from investing in some of the 
portfolios if the jump component of these portfolios could result in the investor's 
bankruptcy. 

Observe that P/i is a probability measure for h G A{T). Then, 

Wl'^Wt+e f ^^h{s)ds 
Jo 

is a standard Brownian motion under the measure P/i and we have (recall the notation 
defined at ([22])) 

N^{ds,dz)^ [ [ N{ds,dz)~ [ [ {1-G{z,h{s))}v{dz)ds 
JZo JQ JZq Jo Jzo 

t n nt 



N{ds,dz)~ / |(l + /i^7(z)) '^}j/(d2)ds 



/O JZo Jo JZo 

As a result, X{t) satisfies the SDE: 

dX{t) = / {X{t), h{t)) dt + AdWf , t £ [0, T] (3.7) 

where 

J{x,h):={h + Bx-eMfh) (3.8) 

We will now introduce the following two auxiliary criterion functions under the 
measure P^: 



the auxiliary function directly associated with the risk-sensitive control prob- 
lem: 



eicple g{Xs,h{s))ds-9lnv\ 



(3.9) 



where Et^x [•] denotes the expectation taken with respect to the measure P/i 
and with initial conditions (i, x). 
• the exponentially transformed criterion 



iiv,x,h;t,T) :=Ef_, 



exp < 9 / g{s, Xs, h{s))ds — 9\nv 



(3.10) 



which we will find convenient to use in our derivations. 



We have completed our reformulation of the problem under the measure P/j. It is 
striking that the asset allocation problem with jump-diffusion asset prices reduces to 
a stochastic control problem for a dijfusion process, with dynamics (j3.7l) and reward 
function dH]) or (|XTO)) . 

3.3. The Risk-Sensitive Control Problems under P/j. Let $ be the value 
function for the auxiliary criterion function I{v, x; h; t, T). Then $ is defined as 



$(t, a;)= sup I{v,x;h;t,T) 

heA{T) 



We will show that $ satisfies the HJB PDE 



where 



9$ , 

— {t, x) + sup Ll<f{t, x)=0, {t, x) G (0, T) X 



L'i<^{t,x) = {h + Bx-9KY.^h{s)) £>$ 

1 9 
tr {kk^D'^<^) - -{D<^>fAA^D^ - g{x, h) 



(3.11) 



(3.12) 



2 ^ '2 
and subject to terminal condition 

$(r,a;) = ln?;, x G R" 



(3.13) 



(3.14) 



Similarly, let $ be the value function for the auxiliary criterion function /(w, x; h] t, T). 
Then $ is defined as 



$(t,x)== inf I{v,x;h;t,T). 

heA{T) 



The corresponding HJB PDE is 



(i, x) + hi (kk^D^^it, x) ] + H{t, X, D$) = 0, 



at 2 
and subject to terminal condition 



$(T,a;) = y- 
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(3.15) 



(3.16) 



(3.17) 



and where 



H{x,r,p) = ini {f{x,h)'^p + 0g{x,h)r} 

for r e R, p e K" and in particular, 

^{t,x) = exp{-6'$(i,x)} 



(3.18) 



(3.19) 



Note that since $ and $ are related through a strictly monotone continuous 
transformation, an admissible (optimal) strategy for the exponentially transformed 
problem is also admissible (optimal) for the risk-sensitive problem. 

4. Main Result. In this section, we present the main result of this article, 
namely that the risk-sensitive jump diffusion problem admits a classical (C^'^) solu- 
tion, and show that the value function $ is convex in x. 

Proposition 4.1. The value function ^{t,x) is convex in x. 
Proof. 

To prove that the value function $(i, x) is convex in x, it is necessary and sufficient 
to show that V(xi,a;2) S M" and for any k e (0, 1), 



$(i, KXl + (1 - k)X2) < K^{t, Xl) + {1 - K)^{t, X2) 

Start from the left hand side: 



(4.1) 



^{t, KXl + (1 - k)X2) 

sup -^InEj'' 



exp{6 g{X„h{s))ds-einv}x{t) 



sup -ilnEj''(^^ ) 
heA{T) 

sup -ilnE,''(^^ ) 



exp|6i j g{KXi{s) + {l- K)X2{s),h{s))ds-e\nv'^x{t) 
exp IkoJ^ g{Xi{s),h{s))ds 



+(1 



giX2{s),h{s))ds-einv}xit) 



= sup --lnEj^(^^ ) 



X exp ■ 



< sup --ln<^E^_,^ 
heA{T) ti I 



expje*^ g{Xi{s),h{s))ds-einv^x{t) 
ej^' g{X2{s),h{s))ds - e In v"^ X{t)j 

expW g{Xi{s),his))ds-einv\x{t) 



exp<;0/ g{X2{s),h{s))ds-einvjx{t) 
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sup 

heAiT) 



-^InE? 



e^ple giX,{s),h{s))ds~einv\x{t) 



< sup -ilnEj'^^ (expit 
heA(T} p LV I 



exp<^6' / g{X2{s),h{s))ds - 9lnv\ xit) 



g{X,{.s),h{.s))d.s-9lnv}xit) 



sup -ilnEj' 



exp<^0 / 5(^2(5), /i(s))ds-01n«W(i) 



< sup -JlnE^^,^ 



exple g{Xi{s),h{s))ds ~e\nv\ x{t) 



1 - K 



sup 



InE? 



exp<^0 / g{X2{s),his))ds-0lnv}x{t) 



= + (1 - K)^{t,X2) 

where the fourth equahty follows from the fact that the covariance of two random 
variables inside the expectations is positive and the third inequality is due to the fact 
that the function a; >H> a;" for a; > and a G (0, 1) is concave. □ 

Corollary 4.2. The exponentially transformed value function <i> has the follow- 
ing property: V(xi,X2) gK^,kG (0,1,), 



+ (1 - k)x2) > ^''{t,Xl)^^-''{t,X2) 



(4.2) 



Proof. The properties follows immediately from the definition of $ = — ^ In $. □ 

We now come to the main result of this article. Recall the standing assumptions: 
in Section [21 Assumption [T] is a condition on the support of the jump measure. As- 
sumptions [5] and [3] are the non-degeneracy conditions SS-^ > 0, AA^ > 0, while 
Assumption m introduced in Section [741 below, is a full-rank condition on the matrix 
A defined at ([24l|) . 

Theorem 4.3. Under Assumptions{^^the following hold: 

1. the optimal asset allocation is the unique maximizer of the supremum (|3.12p 

2. # is the unique C^^^ {[0,T] X K") solution of the RS HJB PIDE ([XTC1) - ([XT7|) . 
Moreover, $ satisfies the property (|4.2I) 

3. $ is the unique C^-^ ([0,T] x M") solution of the RS HJB PIDE ([XT^ - ([m|) . 
Moreover, $ is convex in its argument x. 

Proof. The proof is based on a series of results proved in Sections [SH71 below. 
These combine to give us the following argument. 

Existence of an optimal control - by Proposition 15.11 the supremum in (I3.12p 
admits a unique Borel measurable maximizer. Moreover, by Proposition 16.31 this 
maximizer is admissible and by Proposition 16 .41 it is also a maximizer with respect to 
the P- measure criterion J defined in (j3.1l) . Thus, we can take this maximizer as our 

11 



optimal asset allocation. 

Existence of a classical (C^'"^) solution - by CoroUarv rOl $ is a C^'^ ([0, T] x E") 
solution of the RS HJB PDE (l3lB - (l3Tfl) . 

Uniqueness of the classical solution - the existence of zero beta policies enable 
us to deduce (as in Step 1 of the proof of Theorem 17. 2|) that $ is bounded. Part (i) . 
of Verification Theorem 16 . 1 1 therefore applies. Choosing as optimal control the unique 
maximizer of the supremum in p.l2p . part (ii). of Theorem 16 . 1 1 also applies: $ is the 
unique solution to the HJB FIDE. Property (|4.2p is proved in Corollary 14.21 

By Corollaries 17.31 and 16. 2[ it then follows that $ is the unique classical solution 
to the HJB PIDE (|3.12p with terminal condition p.l4p . Moreover, $ is convex in its 
second argument x. 



This result proves that we have solved our original control problem in the context 
of strong, classical solutions. What would this imply in terms of weaker viscosity 
solutions? As a classical solution is also a viscosity solution, our result implies that 
the value function is indeed a viscosity solution of the HJB PDE. However, uniqueness 
of classical solutions does not necessarily imply uniqueness of viscosity solutions. To 
prove uniqueness in the viscosity sense, we would need a comparison result such as 
Theorem 33 in Davis and Lleo [T5] . 

In the remainder of this paper, we develop the various technical arguments re- 
quired in the proof of Theorem 14.31 

5. Existence of a MsLximizing Control 

. Proposition 5.1. Under Assumption\^ the supremum in (|3.12p admits a unique 
Borel measurable maximizer h(t,x,p) for {t,x,p) G [0,T] x M" x M". Moreover, the 
maximizer h{t,x,p) is an interior point of the set J . 

Proof. The supremum in (j3.12p can be expressed as 



□ 



sup L'I^ 

heJ 




(5.1) 



Under Assumption [21 for any p E M" the terms 




h^-l{z)lxo{z)v(dz) 



and 
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are both strictly concave in ft, Vz e Z a.s. dv. Therefore, the supremum is reached for a 
unique maximizer h(t, x,p), which is an interior point of the set J^, which is the closure 
of the set J7 defined in equation ()2.7p . and the supremum, evaluated at h{t, x,p) £ R", 
is finite. By measurable selection, h can be taken as a Borel measurable function on 
[0,T] X M" X M". 
□ 

6. Verification Tiieorems. In this section, we prove a verification theorem to 
the effect that if (|3.12|) has a C^'^ solution then that solution is equal to $ defined 
by (|3.11l) and the control h*{t) = h{t, x, is optimal. We will first prove a verifica- 
tion theorem for the exponentially transformed problem p.l5|) with HJB PDE p.l6p 
and value function As a corollary, we will obtain a verification theorem for 

the risk sensitive control problem with HJB PDE p.l2|) and value function 

^{t,x). Define the first order operator 

L''(p{t, x) ^ {b + Bx- OKYFKf DLp{t, x) + eg{x, h)(p{t, x) (6.1) 



Theorem 6 . 1 (Verification Theorem for the Exponentially Transformed Control 
Problem). Let ^ be a C^'^ ([0,T] x R") bounded function. 

(i) Assume that 4>{T,x) < e"*'"" Vx G M" and 

^it,x) + ^tr(^AA^D^^{t,x)j + H{t,x,4>,D}) > 

on [0,T] X R", then ^{t,x) < ^{t,x) y{t,x) G [0,T] x M" 

(ii) Further assume that 4>{T,x) — e^^''^" Vx £ R" and there exists a Borel- 
measurable minimizer h*{t,x) of h^-^ L'^cf) defined in (|6.1I) such that 

^{t, x) + ^tr {KK^D'^4){t, x)^ + H{t, x, 4>, D4>) 

= ^{t,x) + ^tr[AA^D^4>{t,x)) +L^'4> 
= 

and the stochastic differential equation 

dX{t) = {b + BX{t) - eAYFh{t)) dt + AdWf 

defines a unique solution X{s) for each given initial data Xf — x and the process 
7r*(s) := h*{s,X{s)) is a well-defined control process in A{T). Then (/) = $ and 
7r*(s) is an optimal Markov control process. 
Proof. The following proof is based on an argument used by Touzi [37) . 
(i). Let h G A{T) be an arbitrary control, with X{t) the state process with initial 
data X(t) = X. Define the stopping time 

TN -.^T A inf {s > t : \Xs - x\ > N} 

Define Z{s) — 9 g{s,Xs,hs)ds, then 

d(e^=) :=0.g(s,X,,/i,)e^= 
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Also, by Ito, for s G [t,Ts], 



d4> 



where £ is the generator of the state process X{t) defined as: 

jC4>{t, x):={b + Bx- eAT,'^h{s)) D4>+^tr {kK^D'^Pj 



By the Ito product rule, and since dZg ■ (j)g = 0, we get 

and hence for s € [t, tn] 

4>{s, Xs)e^^ = 4>{t, x)e^' +e 4>{u, Xu)g{u, X„, /i„)e^" 



du 



Because for an arbitrary control h, 

- {t, x) + ^tr (AA^L>2^(f, a;)) + Xt) 



dt 



> 



dt 
> 



{t, x) + itr (^AA^ D^(j){t, x)^ + H{t, x, 4>, D(j)) 



and e^' > Vs e [t, tn] we have 

4>it,x)e^' <4>{s,Xs)e^' + D4>^AdW^ 
Taking the expectation, we obtain 



In particular, take s — t^^ and note that e^* — 1, then 



Since </> is assumed to be bounded, there exists a constant Ci > such that: 
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Since for an arbitrary admissible control h e A{T) and fixed s e [t, T] there 
exists some constant C2 > such that 



9{s,X,,K) <C2\1+X{s)\ 
Then 

for C3 = Cie^2 and C4 = Cge^'C^-*). 

By the dominated convergence theorem and the assumption that (j){T,Xt) < 



< E 



We have now proved the first part of the theorem, 
(ii). To prove the second part, we can simply apply the same reasoning for the 
optimal control h* . Note, however, that with this choice of control we would 
have 

^(t, x) + itr (^AA^D^4>{t, x)^ + H{t, x, 4>, D4>) 
= ^{t, x) + itr [aA^ D^{t, x)) + L^'^ 



= 



which would lead us to equality in the last equation, i.e. 



1 g{u,X^,'h^)du _ Qi-^y 



Corollary 6.2 (Verification Theorem for the Risk- Sensitive Control Problem). 
Let 4> be a C^'^ ([0, T] x R") n C ([0, T] x R") bounded function, 
(i) Assume that (t>{T,x) < g-^'"" Vx £ R" and 

^ + sup >0 
ot hej 

on [0,r] X R", then <j){t,x) < ^{t,x) V(i,x) G [0,T] x R" 
(ii) Further assume that (f>{T,x) = e"^'""" Vx G R" and there exists a minimizer 
h*{t,x) of h L^(f> defined in (I3.f 31) such that 

^ + sup L^,^{t, X{t)) = ^ + L'f4>{t, X{t)) = 
at at 
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and the stochastic differential equation 

dX{s) = (5 + BX{s-) - eKTFh{s)) ds + AdW^ 

defines a unique solution X for each given initial data Xf = x and the process 
7r*(s) :— h*{s,X{s)) is a well-defined control process in A{T). Then </) = $ and 
7r*(s) is an optimal Markov control process. 
Proof. This corollary follows from equation (|3.19p and from the fact that an 
admissible (optimal) strategy for the exponentially transformed problem is also ad- 
missible (optimal) for the risk-sensitive problem. □ 

Proposition 6.3. The process h*{t) is admissible: h*{t) e A{T). 

Proof. Refer to Appendix |X] for a full discussion and a proof of this proposition. 

□ 

Applying Proposition [631 we deduce that the control h*{t) is optimal for the aux- 
iliary problems (13. 9p and (I3.10p resulting from the change of measure. However, this 
proposition is not sufficient to conclude that h*{t) is optimal for the original prob- 
lem (|3.ip set under the P-measure. The next result show that this is indeed the case. 

Proposition 6.4. The optimal control h*{t) for the auxiliary problem 

sup I{v,x;h;t,T) 

heA{T) 

where I is defined in p.9p is also optimal for the initial problem swpf^^j^^f^rp^ J{x, t, h) 
where J is defined in (|3.1|) . 
Proof. See Appendix [K\ □ 

7. Existence of a Classical Solution. Historically, proving the existence of 
a strong, analytical solution to the HJB PDE was both the main difficulty and the 
main objective when solving a control problem. Fleming and Rishel [16j as well as 
Krylov [23] and [15] have been the main contributors, proposing techniques based 
either on PDE arguments or on probability theory. Recently however, the emphasis 
has switched from strong solutions to weaker types of solution. Viscosity solutions 
have proved particularly useful and successful, gaining many applications in stochas- 
tic control theory (see for example the classic article by Crandall, Ishii and Lions [TT] 
as well as Fleming and Soner PU] for their applications to stochastic control). The 
reason for this appeal is twofold. First, it is significantly easier to prove the existence 
of a viscosity solution than a classical solution. In the viscosity world, the difficulty 
is shifted from proof of existence to proof of uniqueness, and even then it is gener- 
ally easier to prove uniqueness of a viscosity solution via a comparison theorem than 
the existence of a classical solution. Second, the stability result due to Barles and 
Souganidis pi connects directly viscosity solutions to numerical methods, making it 
easy to solve 'real world' control problems. 

This section follows similar arguments to those developed by Fleming and Rishel [16] 
(Theorem 6.2 and Appendix E). Namely, we use an approximation in policy space 
alongside results on linear parabolic partial differential equations to prove that the 
exponentially transformed value functions $ is of class C^'^{{0,T) x R"). Then it 
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follows that the value functfons $ is also of class C^^^((0,r) x M"). The approxima- 
tion in policy space algorithm was originally proposed by Bellman in the 1950s (see 
Bellman [T] for details) as a numerical method to compute the value function. Our 
approach has two steps. First, we use the approximation in policy space algorithm 
to show existence of a classical solution in a bounded region. Then, we extend our 
argument to unbounded state space. To derive this second result we follow a different 
argument than Fleming and Rishel |16j which makes more use of the actual structure 
of the control problem. 

Our interest in classical solutions is as much mathematical as practical. First, 
since a smooth solution is a viscosity solution but the converse is not necessarily true, 
we are proving a stronger result. Second, this stronger result immediately translates 
a better grasp of the analytical properties of the value function. While viscosity 
solutions provide continuity, they do not generally give information about higher 
order derivatives. By contrast, classical solutions are smooth in the state, implying 
that they are (at least) in time and in the state. Third, viscosity solutions 
are purely about solving the PDF and although they show that the value function is 
the unique solution of the HJB PDF they do not prove directly the control problem 
has a solution, that is a pair of a value function and an admissible optimal control. 
Fourth and finally, in our case seeking a strong solution does not impair our search 
for numerical results. Because our state process X{t) can clearly be interpreted as 
the continuous time limit of a Markov Chain, we can apply well-known results by 
Kushner and Dupuis |26j to prove convergence of a finite approximation scheme to 
the value function. We can therefore solve concrete portfolio selection problems quite 
directly. 

7.1. "Zero Beta" Policies. In this section, we introduce a new class of control 
policies: the "zero beta" (0/3) policies: 

Definition 7.1 (0/3-policy). By reference to the definition of the function g in 
equation (|3.4p . a 'zero beta' (0/3) control policy h(t) is an admissible control policy 
for which the function g is independent of the state variable x. 

The term 'zero beta' is borrowed from financial economics (see for instance 
Black [10 ). To avoid assuming the existence of a globally risk- free rate in factor 
models such as the CAPM, the APT or in ad-hoc valuation models, it is customary 
to build portfolios without any exposure to the factor (s) as a substitute for the risk- 
free rate. These special portfolios are referred to as 'zero beta' portfolios by reference 
to the slope coefficient (3 used to measure the sensitivity of asset returns to the valu- 
ation factor (s). 

In the risk sensitive asset management model, if = 0, then the policy h'^ = 0, 
i.e. invest all the wealth in the risk-free asset, is a 0/3-policy. When ^ 0, we see 
from (|2.1ip that a 0/3-policy can exist only if there is a vector h satisfying 

h^A = -Ao. (7.1) 

We introduce the following standing assumption. 
Assumption 4. The matrix A has rank n. 

Under this assumption there are always 0/3-policies: we have only to take a vector 
h satisfying (|7.ip and scale it if necessary so that h ^ J (see (12. 7p ) and then h{t, uj) — h 
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is a 0/3-policy. We have no reason to consider anything other than the set Z of 
constant pohcies of this kind. Note that when h £ Z the function g of (|3.4[) is a 
constant, g{x, h) = g. 

7.2. The L'^{K) and J^'^{K),l < 7] < oo Spaces. The following ideas and 
notations relate to the treatment of linear parabolic partial differential equations found 
in Ladyzhenskaya, Solonnikov and Uralceva [28] . The relevant results are summarized 
in Appendix E of Fleming and Rishel. They concern PDEs of the form 

^ + itr (a(t, x)D^iJj) + b{t, x)^DxP + ec{t, x)^; + d{t, x) = (7.2) 

on a set Q = {0,T) x G and with boundary condition 

i>{t,x) = ^rix) X eG 

ipit, x) = x) {t, x) e (0, T) X dG 

The set G is open and is such that dG is a compact manifold of class C^. Denote by 

• d*Q the boundary of Q, i.e. 

d*Q := {{T} X G) U ((0,T) x dG) 

• L^{K) the space of 77-th power integrable functions on if C Q; 

• ll'll^./f the norm in L''(if). 

Also, denote by ^''(Q), 1 < < cx) the space of all functions ip such that -0 and 
all its generalized partial derivatives are in L^{K). We associate with this space the 
Sobolev-type norm: 



I (2) 



dt 



i=i 



I 

I dxi 



ri,K 



E 



(7.3) 



We will also introduce additional notation and concepts as required in the proofs. 



7.3. Existence of a Classical Solution. In this section, we use an approxima- 
tion in policy space to show the existence of a C^'^ solution to the RS HJB PDE p.l2p . 

Theorem 7.2 (Existence of a Classical Solution for the Exponentially Trans- 
formed Control Problem) . The RS HJB PDE (|3.16p with terminal condition $(T, x) = 
e"^'"" has a solution $ e C^ '^ ((0,T) x R") with $ continuous m [0,T] x M". 

Proof. Step 1: Approximation in policy space - bounded space 

Consider the following auxiliary problem: fix i? > and let be the open n- 
dimensional ball of radius R> centered at defined as :— {x € M."^ : \x\ < R} . 
We construct an investment portfolio by solving the optimal risk-sensitive asset al- 
location problem as long as X{t) G ^r for R > 0. Then, as soon as X{t) ^ ^r, 
we switch all of the wealth into the 0(3 policy h from the exit time t until the end of 
the investment horizon at time T. The HJB PDE for this auxiliary problem can be 
expressed as 

^ + itr (AA'^(t)i:>2|>) + H{t, x, !>, D^) = V(t, x) e Qr := (0, T) x .^r 

(7.4) 
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where, as in (|3.18|) 



H{x, r,p) = mi_{f{x, h)'^p + 9g{t, x, h)r} 

hej 

for p e M" and for g and / defined in (|3.4|) and p.8|) respectively. J^, defined as the 
closure of ^7 is a compact set. The boundary conditions are 

x) = x) V(t, x) e 9*gfl ((0, T) x U ({T} x 

with 

• *(r,a;) = e^^i"" Va; G ^i?; 

• *(i,a;) := ipit,x) := e^^iT-t) v(i,x) £ (0,r) x O^Sr and where h is a fixed 
arbitrary 0/3 policy which is constant as a function of time. Note that ip is 
obviously of class C^'^{Qr) and that the Sobolev-type norm 



\<lQn = m'X (7-5) 



is finite. 



Define a sequence of functions |)\ l>2,... on Qr = [0,7] X i^fl and of 

bounded measurable feedback control laws h'^, h^,... h^,... where h'^ is an arbitrary 
control. Assuming h^~^ is known, we define the function $^^+1 as the solution to the 
boundary value problem: 



itr (AA^(t)i:>2^fc^ _^ + eg{t, x, h''-^)^'' = 

(7.6) 



subject to boundary conditions 

$(i, a;) = a;) V(i, x) e 9*gfl := ((0, T) x 9^^) U ({T} x ^j^) 



Note that the boundary value problem ()7.6p is a special case of the generic problem 
introduced earlier in equation (|7.2p with 

a{t,x) = AA'^(t) 
6(i,x) = /(x,/i'=) 
c(i,a;) = g{t,x,h'') 
d{t,x) = 

Moreover, since ^r is bounded and J' is compact, all of these functions are also 
bounded. Thus, based on standard results on parabolic Partial Differential Equa- 
tions (see for example Appendix E in Fleming and Rishel [16 , and Chapter IV in 
Ladyzhenskaya, Solonnikov and Uralceva [21]), the boundary value problem (|7.6p ad- 
mits a unique solution in ^^^{Qr). 

Next, for almost all (t, x) £ Qr, k — 1,2, . . ., we define by the prescription 

h'' = Argmin^gj Kf D^^ + Ogit, x, h)¥^ (7.7) 
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so that 

fix, h^fD^ + eg{t, X, h^)^^ = in£ (/(x, Kf D^^ + ^^(t, h)^^\ 

= (7.8) 



Note, in view of the definition of g in (|3.4p . that the minimum is never achieved 
on the boundary of J', i.e. /i*^ takes values in J'. 

Observe that the sequence ( ) is globaUy bounded. Indeed, by Feynman- 

V / ken 

Kac, the sequence ( ) is bounded from below by 0. By the optimality principle, 

V / ken 

it is also bounded from above by e^-^* 9{x{s),h)ds _ g9g{T-t)^ Moreover, these bounds 
do not depend on the radius R and are therefore valid over the entire space (0, T) x ffi.". 

Step 2: Convergence Inside the Cylinder (0,r) x 3§r 

Step 2.1: Monotonicity of the Sequence 

Take fc > 1. Subtracting the PDE for l>'''+^ from the PDE for we see that 



dt 



dt 



1 

-tr 
2 



AA' {t)D^^ 



2^fc+l 



AA' (t)!)^**^ 



k+l 



= in(0,r)xM" 
with $'=+1 - I)'' = on W\ 

Add and subtract f{x, h'')'^ D^^ + eg{t, x, h'')^'', 



k+l 



dt 



1 

-tr 
2 



AA^(t)I?2|.fc+i) ^ (AA^(t)I?2|.fc 



f{x, h^y D<i>''+^ - fix, h"-^)'^' D'^)" ) + 0{ g{t, X, h'')^''+^ - g{t, x, 



+ (/(x, h^fD¥ + 9g{t, X, 
in (0, T) X R" 



(/(x, h'^fD^^ + eg{t, X, h'')^'' 



Rearranging, 



2*^ 



(AA^(Oi?'$'=+i) - (AA^(Oi?'$' 



\ 9t at y 

+/(x, /i'^')^ (d^'^+i - Dd''^) + 6ig(t, X, /i*^) (d'^+i - 



+ (^/(x, h^fD¥ + 6l.g(t, X, 
in (0, T) X M" 



/(x, /i'^-i)'^!?!-'^ + eg{t, X, h''-^)¥^ 
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Define the function &{t,x) as 

tit.x) := (^f{x,h^f d¥ + eg{t,x,h^)^^^ - (^f{x,h^-^f D^^ + eg{t,x,h*'-^)^^'^ 

By the definition of given in dTT]), t{t, x) < V(t, x) £ [0, T] x M", Vfc £ N. Define 
the sequence of functions (W^'^)fcGN as 

y^k ._ |,fc+i _ ^fc 

then satisfies the PDE 



+ -tr {Kk^{t)D^W^) + /(x, h^fDW^ + 6'.g(t, a;, /i'=)T/F'= + t^{t, x) = Q 



dt 2 

(7.9) 

in (0, T)x3§R, and with boundary condition W^{T, x) = Q on d*QR, = ((0, T) x BSSb)^ 
{{T] X i^^). 

Define the stopping time tq as the first exit time from SSr: 

TG:^ini{t:X{t)iG} 

By a standard Feynman-Kac representation, W''{t,x) can be represented by the ex- 
pectation 



W''{t,x) = E 



T/\TG 



(7.10) 



Because £(i, x) < 0, W^{t, x) < for fc > 1 and hence by definition of , 

$fc > Vfc e N 

which implies that the sequence > is non increasing. 

L J ken 

Step 2.2: Convergence of the Sequence 

Since the sequence ($'^)fceN is non increasing and is also bounded, it converges. Denote 
by $ its limit as fc — >■ oo. Now, since the Sobolev-type norm is bounded 

for 1 < 77 < oo, we can apply the following estimate given by equation (E.9) in 
Appendix E of Fleming and Rishel 

m]^: <Mn\\^'^\\%^ (7.11) 
for some constant Mn (depending on R) and where 

n + 2 



11 = 1 



V 
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and 



\^%= sup sup 



(t,x)&Qu {x,y)eG ^1'' 

0<t<T 

|$''(s,x) - ^''it,x)\ 

0<s,t<T 



to show that the Holder- type norm |$''|q^'^ is bounded. As A: ^ c» we conclude that 

• converges to _D<& uniformly in L^^{Qji) ; 

• D^^'' converges to weakly in L^{Qr) ; and 

• converges to ^ weakly in L'^{Q]i). 

Step 2.3: Proving that $ G ^(q^) 
Using estimate ()7.1ip . we see that 1^*^10^^ is bounded for /i > 0, which implies that 
77 > n + 2. Using relationship (|7.8p and then equation (|7.6p . we get: 



at 2 



itr (AA'^(t)i:>2|,fc^ _^ KfD^^ + 9g{t, x, h)^'' 



"(AA^(t)i^2|,fc^ _ (AA^(t)I?2|.fc+i 



-tr 



\ dt dt \2 
+f{x, h^ f {p¥ - d¥+^^ + 6g{t, X, h^) - l>'=+i) (7.12) 

for any admissible control h. 

Since the left-hand side of (|7.12p tends weakly in L^{Qr) to 

^ + itr (AA^(t)i?2$) + /(a;, hfo^ + eg{t, x, h)^ (7.13) 

as fc — > 00 and the right-hand side tends tends weakly to 0, then we obtain the 
following inequality 

^ + itr (kk^{t)D^^) + fix, hfD^ + 0g{t, x, h)^ > 



dt 2 
almost everywhere in Qr. 

Using a measurable selection theorem and following an argument similar to that of 
Lemma VI. 6.1 of Fleming and Rishel [16,, we see that there exists a Borel measurable 
function h* from (0, T) x into J (in fact J) such that 

fix, h*fD^ + Ogit, X, h*)^ = in£ (/(x, hf + Ogit, x, h)^\ 
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holds for almost all {t,x) G (0,r) x ^fl.. Then 



dt 



tr ( AA'^ {t)D^^'' ] +f{x, h*fD^'' + 9g{t, x, /i*)l>*^ 



1 



(^AA^{t)D^^''^ + fix, h'^fD^^ + eg{t, x, h'')^^ 



1 



\ dt dt 
+f{x,h^f [d^^ -D¥ 



AA' - AA^ {t)D^^ 



-^)+0g{t,x,h'')(^ 



fe+i 



Since the left-hand side of (|7.14p tends weakly in L'^{Qn) to 

5 + itr (AA^(i)i^'l') + fix, h*fD^ + eg{t, x, h*)^ 



(7.14) 



dt 2 

as fc — > oo and the right-hand side tends weakly to 0, then we obtain the inequality 

^ + itr {aA^ {t)D'^^^ + fix, h*fD^ + 9git, x, h*)^ < (7.15) 
almost everywhere in Qn. 

Combining (|7.13p and (|7.15p . we have shown that 

5 + itr (AA'^it)D^^) + fix, h*fD^ + Bgit, x, /i*)$ = 



dt 2 
almost everywhere in Qj^. 



Hence, (f> is a solution of equation p. 161) on a bounded domain. Moreover, 
$ e >C''(Q_R,). Also, since H is locally Lipschitz, |^''|q^ < oo for ^ > and 
\D^^\'^^ < oo for /i > 0, then \H{t,x,^'',D^'')\'^^ < oo. 



We can now show that $ G C^'iQn)- Define 



dt 



Iq^ 



Consider the following estimate given by equation (ElO) in Appendix E of Fleming 
and Rishel 



<i>|nt''<M2|l<i>||Q" 



(7.16) 



for some constant M2, and two open subsets Q' and Q" of Q such that Q' C Q". In 
this estimate, set Q" = Qn and take Q' to be any subset of Q such that Q' C Q. 
Thus 



(7.17) 



When interpreted in light of estimate ()7.1ip (stemming from (E9)), we see that the 
derivatives and ^rf- satisfy a uniform Holder condition on any compact 
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subset Q' of Qif . By Theorem 10.1 in Chapter IV of Ladyzhenskaya, Solonnikov and 
Uralceva [35], we can therefore conclude that $ G C^''^{Qr)- 

Step 3: Convergence from the Cylinder [0,T) x SSr to the State Space 

[o,r) X R" 

step 3.1: Setting 

Let > be a non decreasing sequence with limi^oo Ri oo and let 

be the sequence of stopping times defined as 

n := inf {i : X{t) ^ .^r^} 

Note that {rijjgpj is non decreasing and limi_s.oo Ti = oo. 

Denote by l>'^*' the limit of the sequence ($'^ ) on (0,r) x ^r-, i.e. 

V /fcGN 

l>«(t,x) = lim l>'=(i,x) \/{t,x) e (0,r) x (7.18) 



Step 3.2: Convergence of the sequence (<I>*^*M 

V /isN 

First, observe that the sequence (<I>(*^)igN is non increasing. Indeed, for i < j the 
stochastic control problem defined over (0,T) x 3§r- is nested into the stochastic 
control problem defined over (0,T) x .^^r.. In particular, a suboptimal strategy for 
the stochastic control problem defined over (0, T) x ^Sr^ would be to invest optimally 
while X G 3^R^ and then switch to the 0/3 policy h when x G ^R^X^^Ri- By the 
optimality principle, the expected total cost of such strategy is greater than the value 
function But this suboptimal strategy also corresponds to the optimal strategy 
for the stochastic control problem defined over (0, T) x £§r. . Hence 

{t, x) > l>'^') {t, x) <E N, \/{t, x) e (0, T) X ^R^ 

By the argument in Step 1, the sequence is also bounded. As a result, 

it converges to a limit $. This limit satisfies the boundary condition p.l7p . We now 
show that $ is C^'^ and satisfies the HJB PDE. These statements are local properties 
so we can restrict ourselves to a finite ball Qr. 

Step 3.3: Proving that $ £ C^''^{Qr) 

Now that we have shown the convergence of the sequence ( ^'^^M though a simple 

control-based argument, we can conclude the proof using the same arguments based 
on Ascoli's theorem as Fleming and Rishel (see [16], proof of Theorem 6.2 in Appendix 
E). 
□ 

Corollary 7.3 (Existence of a Classical Solution for the Risk-Sensitive Control 
Problem). The RS HJB PDE p.l2p with terminal condition $(T, x) — Inv has a 
solution $ e C^'^ Qo,T] x M") with $ continuous in [0,r] x R". 

8. Partial Observation. In this section we show how the results of the paper 
can be extended to the case where the factor process X{t) is not directly observed and 
the asset allocation strategy ht must be adapted to the filtration J^f ~ (T{Si{u),0 < 
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u < t,j — 0,...,m} generated by the asset price processes alone. In the hnear 
diffusion case studied by Nagai j32 and Nagai and Peng [33], the authors noted 
that the pair of processes {X{t),Y{t)), where Yi{t) = log Si{t), take the form of the 
'signal' and 'observation' processes in a Kalman filter system, and consequently the 
conditional distribution of X{t) is normal N{X{t),P{t)) where X{t) = E[X(t)|jf] 
satisfies the Kalman filter equation and P{t) is a deterministic matrix-valued function. 
By using this idea they obtain an equivalent form of the problem in which X(t) is 
replaced by X{t) and the dynamic equation (|2.3p by the Kalman filter. Optimal 
strategies take the form h{t, X{t)). This is in fact a very old idea in stochastic control, 
going back at least to Wonham [40]. 

8.1. Decomposition. At first sight it does not seem apparent that the same 
approach can be used here, as the price processes contain jumps, but a simple ob- 
servation shows that the jumps play no role in the estimation process, which is still, 
at base, the Kalman filter; see Proposition 18.11 below. A further complication is that 
the money market interest rate r{t) = ao + AQX{t) (see (|2.4p ) is observed directly 
and contains information about X{t). This was not the case in [35] and [33] where, 
in our notation, Aq — 0. We start by assuming that Aq = 0, and briefly discuss the 
extension to 7^ at the end of the section. 

Recall first that X{t) satisfies 

dX{t) ^ {b + BX{t))dt + AdW{t), X{0) = Xo (8.1) 

When Xt is observed, the initial value Xq is just a constant. In the present case we 
need to assume that Xq is a normal random vector iV(mo, Pq) with known mean mp 
and covariance Pq, and that Xq is independent of the processes W,Np. 

An application of the general Ito formulcH shows that for i = 1, . . . , m the log- 
prices Yi (t) satisfy Yi (0) = log Si and 



dY,{t) 



(a + AX(t)), --SS 



1 ^ 



(it + ^crifedl^fe(t) 



{\n{l +y,{z))--f,{z)}v{dz)dt+ / \n{l + j,{z)) N{dt,dz). (8.2) 



Proposition 8.1. Define processes Y^{t),Y'^{t) e R'" as follows. 
dY^{t) = AX{t) + J:dW{t), r/(0)=0, (8.3) 

dYj^it) = c,dt+ \n{l + -/{z))^N{dt,dz), i = l,...,m, y/(0)=logs. 



z 



with c e M™ defined by 



a, - isS^ + / {In (1 + 7,(z)) - j^{z)}u{dz) 



2 



Zo 



so that Y{t) = Y^{t) + r2(t). Also, define y,t = a{Y'{u),0 <u<t},i = l,2. Then 

(i) The processes Y^,Y^ are each adapted to the filtration J-f. 

(ii) For any bounded measurable function f and t > 0, 

E[/(x(t))|j-f] = E[/(x(t))|3;u]. 



^See 0ksendal and Sulem |42| for this calculation. 
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Proof, (i) S{t) and Y{t) are in 1-1 correspondence and therefore generate the 
same filtratfon J-f. Apart from rearrangement of deterministic terms, the decompo- 
sition Y = + Y"^ is the same as the standard decomposition Y — Y'^ + Y'^ of a 
semimartingale into its continuous and discontinuous components, see paragraph VI. 
37 of Rogers and Wilhams [35^ . 

(ii) and W{t) are independent and as a result yu and 3^2t are independent, and 
clearly J^^ — yu V 3^2*. The result follows, since X(t) is independent of 3^2*. Q 

8.2. Kalman Filter. The processes {X{t),Y^{t)) satisfying (HH) and (|H31) and 
the filtering equations, which are standard, are stated in the following proposition. 

Proposition 8.2 (Kalman Filter). The conditional distribution of X{t) given 
yit is N{X{t),P{t)), calculated as follows. 

(i) The innovations process U{t) G R™ defined by 

dU(t) = (EE^)^'/' {dY^it) - AX{t)dt), U{Q) = (8.4) 

is a vector Brownian motion. 

(ii) X{t) is the unique solution of the SDE 

dX{i) ^ {b + BX{t))dt+(^AJ:'^ + P(t)i^) (EE^)"^^^ dU{t), 1(0) = niQ. (8.5) 

(Hi) P{t) is the unique non-negative definite symmetric solution of the matrix Riccati 
equation 



P{t) - ASS^A^ - P(i)i^ (SS^)"' AP{t) + (^B- AE^ (EE^)"' Aj P{t) 
+P(t)(p^-i^(EE^)"'EA^) , P(0)=Po 

w/iere S / - E^ (E^E)"'e. 

To conclude, the Kalman filter has replaced our initial state process X{t) by an 
estimate X{t) with dynamics given in (18. 5p . To recover the asset price process, we 
use (|8.3|) together with (|8.4p to obtain the dynamics of Y{t): 

dY,{t) = dY^\t)+dY^{t) 

^a,+ AX{t)dt - ^EE^dt + (EE^) dU{t) 



+ / {ln(l + 7,(z))-7,(z)}i.(dz)+ / ln(l + 7(z)),iV(dt,dz). (8.6) 
We then apply Ito to Si (t) — exp Yi (t) to get 

^0- = ia + AXmdt + Y] f(EE^)'/'l dUk{t)+ [ -i,{z)N{dt,dz), 

S,iO) = s„ i^l,...,m (8.7) 
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We now solve the stochastic control problem with partial observation simply by 
replacing the original asset price description (|2.5p by (|8.7p . and the factor process 
description (|2.3p by the Kalman filter equation (I8.5p . in our solution of full observation 
case. The Kalman filter has time-varying coefficients, but this does not affect the 
preceding arguments. 

Finally, we briefiy sketch what to do if 7^ 0. We observe the short rate 
r(t) = ao + AQX{t), and hence the 1-dimensional statistic Yo{t) = exactly. 
We need to assume that this observation contains positive 'noise', i.e. AqAA^Aq > 
0. Changing coordinates if necessary, we can assume that Aq = (0, 0, . . . , 1) and 
hence Yo{t) = Xn{t). Our 'observation' is now the (m + l)-dimensional process 
Y — (Yq, . . . , Ym) and we can set up a Kalman filter system to estimate the unobserved 
states X = {Xi, . . . , Xn-i)'^ G R"~^. Ultimately, our optimal strategy will take 
the form h{t, Xi{t), X{t)), where X{t) is the Kalman filter estimate for X{t) given 
{Y{u),u < t}. The details are left to the reader. 

9. Conclusion. In this article, we extended the classical risk-sensitive asset 
management setting to include the possibility of infinite activity jumps in asset prices. 
We applied the change of measure technique proposed by Kuroda and Nagai |27] to 
derive the Hamilton- Jacobi-Bellman Partial Differential Equation associated with the 
control problem and then proved the existence and uniqueness of an admissible opti- 
mal control policy. Using an approximation in policy space algorithm, we established 
the existence of a classical C^'^ ((0,^) x R") solution and obtained the uniqueness 
of this solution through a verification result. This approach also extends naturally 
and with similar results to a jump-diffusion version of the risk-sensitive benchmarked 
asset management problem. 

Finally, we have observed that an attractive, if somewhat surprising, feature of 
the jump diffusion risk sensitive asset management is that it naturally prohibits any 
investment policy which may result in the investor's bankruptcy. In particular, in 
the risk-sensitive setting presented in this article, an investor who implements the 
optimal asset allocation is certain of remaining solvent over the investment horizon. 
This contrasts with the Merton type of approach in which the threat of bankruptcy 
remains present and has to be accounted for using a stopping time. 

The approach presented in this article extends naturally to a jump-diffusion ver- 
sion of the risk-sensitive benchmarked asset management problem introduced by Davis 
and Lleo |12| and would yield similar results, namely the existence of a unique ad- 
missible control policy and of a classical C^'^ solution to the associated RS HJB PDE. 

Appendix A. Admissibility of the Optimal Control Policy. The admis- 
sibility of the optimal control process h*{t) solving (|5.ip is linked to the existence 
of a probability measure P^. , which itself hinges on the characterisation as an ex- 

ponential martingale of the Radon-Nikodym derivative — Xt defined in (j3.5p 

via the Doleans exponential introduced in (|2.9p . In the setting of Kuroda and Na- 
gai |27| , the admissibility of the control follows easily from an argument in Gihman 
and Skhorokhod [2T which proves that the the Doleans exponential (here a Girsanov 
exponential with Gaussian integrand) is an exponential martingale. However, when 
the Doleans exponential does not have continuous path, as is the case in a jump dif- 
fusion setting, proving that it is indeed a martingale is more difficult. As noted by 
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Protter |3S], some partial results exist in this case (see for example Memin [3D1 and 
more recently Protter and Shimbo |34| ) . but none is as powerful as their counterparts 
in the continuous case, namely the Kamazaki or the Novikov conditions. 



To show that the Doleans exponential introduced in (|2.9I) is a martingale we will 
apply results derived by Memin f30|. We recall here the definition of the Doleans- 
Dade exponential as well as results from Memin [3^ (see also Exercise 13 in Chapter 
V of [3S]) on the multiplicative decomposition of local martingales that we will use to 
prove our point. 

Definition A.l (Doleans-Dade exponential). The Doleans-Dade exponential 
£{X){t) of a semimartingale X (t) is defined as 

£(X)(t)=exp|x(i)-i[X^X^]A n (l + AX,)e-^^= (A.l) 

^ 0<s<t 

Definition A. 2 (Memin's Additive Decomposition of Local Martingales). 
Let M(t) be a local martingale. We define an additive decomposition of M into 
two processes Mi{t) and IVhit), i.e. such that M{t) = Mi{t) + A-hit). 

In this decomposition, the process Mi{t) is defined as Mi{t) — L{t) — L{t) where 

Lit) - E ^^^^^{lAAf.ia} 
0<s<t 

and L(t) is the compensator of L{t). 

Proposition A. 3 (Memin's Proposition III-l). Let M(t) he a local martingale 
with additive decomposition as per definition \A.2\ and such that Mq = 0. Then 
(i) £{M) has the decomposition 

£{M) = £{M2)£{Mi) 

where 

~ , , , , V- AA-fi(s)AM2(s) 
M,(.).M,(t)- E i + AAf.() ' 

(ii) £{M2)Mi is a local martingale. 

(Hi) If AM{s) > -1 then AMi(s) > -1 for all finite s. 

Corollary A. 4 (Memin's Corollary III-2). Let N be a local martingale such 
that AN{s) > —1 for all finite s, and such that £{N{oo) is uniformly integrable. Let 
P' he the probability defined as 

Let Ni be a local martingale with locally integrable variations and denote by Ni 
the V -semimartingale defined as 

/ ^ ,r / ^ V- ANAs)AN{s) 
N,it)-mit)- E i;a^(,) ^ 

0<s<t ^ ' 
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then Ni is a P' local martingale, with locally integrable variations. Moreover, the P' 
predictable compensator o/^q^j,<-j |AA^i(s)| is equal to the P predictable compensator 
ofi:o<s<t\^N,{s)\. 

Theorem A. 5 (Memin's Theorem III-3). Let M{t) be a local martingale with 
additive decomposition as per definition \A.S\ If the predictable compensator of the 
process 



Y{t) = [M^,M%+ J2 \^M,{s)\ + {AAUs)f (A.2) 



0<s<t 0<s<t 

is bounded, then £{M){t) is uniformly integrable. 



Proof of Proposition \ 6. 3[ To prove that the control h*{t) is admissible, we need 
to show that the local martingale M* (t) defined as 

M*{t):^-e [ {h*{s)fY.dWs- [ [ \n{l-G{z,h*{s)))N{ds,dz) (A.3) 
Jo Jo Jz 

and such that 

mm = x*t 

is an exponential martingale. 

To achieve this objective, we will define a new class of control processes to which 
the optimal control belongs. We will start from the definition of a control ft, as a 
function: 

h:[0,T]xW-^J 

{t, x) i~> h{t, x) 



where the set J' was defined in (|2.7p . Based on this definition, the control space can 
be viewed as a functional space. 

Define the functional C{x,p, h) as 

C{x,p, h) := -i {9 + 1) h'^Y.Y.'^h - 9h^Y,k^p + h^ {a + At) 

^\ I { + h^^i^))'" - l] + Oh^^{z)lz,Xz)} v{dz) 

(A.4) 

where p e R" so that 

supLj** = {b + Bxf D<^ + {KK^D'^'^) - ^{D<^>fKK^D<^ + ao + Alx 



svLpC{x,D^,h) 
he J 



(A.5) 



and the unique maximizer of L'l^{t,x), h{t,x), is also the unique maximizer of 
C{x,D<i>,h). 
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Observe that with the choice of control function h°{t, x) :— V(t, x) e [0, T] x R", 
the functional C{x,p,h") — V{t,x,p) e [0,T] x M" x M". Invoking the optimality 
principle, we deduce that C{x, D^, h* {t, x)) > 0. 

Denote by the range of the control functions h{t,x) such that £{x,p,h) > 
0. Under Assumption [1] the set J', defined by p.7p . is in the interior of a hy- 
percube and since the functional C{x,p,h) is smooth, strictly concave in h and 
Imifi^Qj C{x,p,h) = — oo, we deduce that the set is a closed convex subset of 
J for all (t, x) G [0, T] x K". The control functions h take the form 

/i : [0,T] X K" ^ J C J 
(t, x) M> x) 

More formally, we can define a class H(r) of Markov control processes as 

Definition A. 6. A control process h(t) is in class 'H{T) if the following condi- 
tions are satisfied: 

1. h{t) is in class % introduced in Definition \2.1[ 

2. h{t, x)€j V(t, x) e [0, T] X R". 

In particular, we note that the optimal control process h*{t) £ Vt G [0,T] 

and Vcj G il. 

For any control policy h{t) G 'H{T), define the local martingale M{t) as 

M{t):=-e h{sf^dWs~J^ J In (l - G{z,h{s))^ N{ds,dz) (A.6) 
Also, let L{t) be the process defined as 



0<s<t 



JZ\Z: 



In {l - G{z,h{s))j N{ds,dz) 



where Zi = {z G Z : |Ai;| < ^,0 < s < i}. Then, the process Mi{t) := L{t) - L{t) 
can be expressed as: 



Mi{t) = ~ / In (l - G{z,h{s))\ N{ds,dz) 

<J <J Z\Zi 

To complete our decomposition of the local martingale M{t), we define the process 
M2{t) as 

M2{t) ^ M{t) ^ Mi{t) 

ln(l- 



Jz 



(l - G{z,h{s))^ N{ds,dz) 



The next step is to study each component of the process Y{t) defined in (|A.2P : 
• The process 



[M", M% = exp !^e^ /i(s)^SS'^/i(s)ds| 



is clearly bounded because h{s) G H(T) for all s G [0,t] 
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The process 



^ |AMi(s)|= f I \n(l-G{z,h{s))\ N{ds,dz) 

0<s<t JZ\Zi 



is bounded because h{s) G i-L{T) for all s £ [0,t]. In addition, the number of 
jumps greater than ^ is finite: 

# {0 < s < t; |AMi(s)|} = # {O < s < t; |AAf(s)|i||^j^,^|>i|} 



< oo 



-oo, 

' 2 



U 



2,oo 



Finally, we turn our attention to the process 



J2 iAAMs)f = f I 



0<s<t 



In (1 - G(0,/i(s))) N{ds,dz) 



Recalling that we assumed that in our setting 

\-^{z)\^v{dz) < oo 



and taking into consideration the fact that h{s) G 'H{T) for all s G [0,t], then 
we deduce that 



ln[l-G{z,h{s))j v{dz) <oo 
for any w G 51, which proves that 

\n{l -G{z,h{s))) ^ N{ds,dz) < oo 





By Theorem I A. 51 the Doleans-Dade exponential 

is uniformly integrable for all h G H{T). We can now apply Corollary I A.4I to formally 
define the measure P?. In particular, the measure P^. characterized via the Radon- 

Nikodym derivative Xt "^"^ll defined because h*{t) G H(T) Vcj G fi. This proves 

that the control h*{t) is admissible for all t G [0,T] and a; G il. 

□ 

Note that the control policy — corresponds to investing the entire wealth 
into the money market asset for the duration of the investment period. The associ- 
ated measure P^o is well defined and it is equal to the physical measure P. In fact, 
this proof not only shows that the optimal control process h*{t) is admissible, but 
also that a large class of "reasonable" control processes h{t) is also admissible and is 
associated with a well-defined probability measure. 
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Proof of Proposition lUm Consider the exponentially transformed problem iT^fh£A{T) Ji^, t, h) 
where 



J{x,t,h) InE 



^-91nV(t,x.h) 



(A.7) 



Note that because the term e ^(t.xji) bounded from below by 0, inf;,g_4cj-) J{x, t, h) 
is well defined which implies that there exists at least one minimizcr h. 



E 



^-einV(t,x,h) 



exp < 6* / g{s,Xs,h{s))ds — 9\nv 



(see for example Lemma 8.6.2. in [5T]) and hence 



inf E 

heA{T) 



nnV{t,x,h) 



heAiT) ' 



exp < 6* / g{s, Xs, h{s))ds — 9 In V 



I{v,x-h*{t)-t,T) 



which proves that the optimal control h*{t) for the auxiliary problem ^^VneAiT) H'^^j t, T) 
derived in Section [3.31 is indeed optimal for the problem sup^g^^j^) J{x,t, h). 
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